Model Selection

llama.cpp compatibility

# llama.cpp compatibility

Minicpm4 8B Q8 0 GGUF

MiniCPM4-8B-Q8_0-GGUF is a model converted from openbmb/MiniCPM4-8B to GGUF format via llama.cpp, suitable for local inference.

Large Language Model

Transformers Supports Multiple Languages

Infly Inf O1 Pi0 GGUF

A quantized version based on the infly/inf-o1-pi0 model, supporting multilingual text generation tasks, optimized with llama.cpp's imatrix quantization.

Large Language Model Supports Multiple Languages

Qwen Qwen3 32B GGUF

Quantized version based on Qwen/Qwen3-32B, using llama.cpp for quantization, supporting multiple quantization types for different hardware requirements.

Large Language Model

Rombo Org Rombo LLM V3.1 QWQ 32b GGUF

Rombo-LLM-V3.1-QWQ-32b is a 32B-parameter large language model, processed with llama.cpp's imatrix quantization, offering multiple quantization versions to accommodate different hardware requirements.

Large Language Model

Llama3 8B 1.58 100B Tokens GGUF

A GGUF format model converted from Meta-Llama-3-8B-Instruct and HF1BitLLM/Llama3-8B-1.58-100B-tokens models, suitable for llama.cpp inference

Large Language Model

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase